Post-Filter Optimization for Multichannel Automotive Speech Enhancement

نویسندگان

  • Huajun Yu
  • Jörg Schöbel
چکیده

In an automotive environment, quality of speech communication using a hands-free equipment is often deteriorated by interfering car noise. In order to preserve the speech signal without car noise, a multichannel speech enhancement system including a beamformer and a post-filter can be applied. Since employing a beamformer alone is insufficient to substantially reducing the level of car noise, a post-filter has to be applied to provide further noise reduction, especially at low frequencies. In this thesis, two novel post-filter designs along with their optimization for different driving conditions are presented. The first post-filter design utilizes an adaptive smoothing factor for the power spectral density estimation as well as a hybrid noise coherence function. The hybrid noise coherence function is a mixture of the diffuse and the measured noise coherence functions for a specific driving condition. The second post-filter design applies a new multichannel decisiondirected a priori SNR estimator based on both temporal and spatial smoothing. For different driving conditions, both post-filters are instrumentally optimized: For the first post-filter, the optimal adaptive smoothing factor and the optimal hybrid noise coherence function are obtained. For the second post-filter, the weighting factors of the temporal and spatial smoothing parts are optimized. Compared to state-of-the-art post-filters, both post-filter designs employing the optimized parameters improve the overall noise reduction performance significantly for different driving conditions. Generally, manually finding the optimal parameterization of a noise reduction algorithm is a time-consuming task. In this thesis, the two new post-filter designs are thus instrumentally optimized by using a figure of merit (FoM). We define the FoM as an entity, which comprises three independent instrumental measures for the speech component quality, the level of noise attenuation, and the amount of musical tones. Particularly, a new weighted log kurtosis ratio measure is proposed for instrumental musical tones assessment in a black-box test manner, which does not mandate any knowledge of internal variables of the noise reduction algorithm under test and can be applied to a wide range of noise reduction algorithms. Subjective listening tests reveal that the weighted log kurtosis ratio measurements can provide a high correlation to the perceived amount of musical tones. In addition, a single-channel application example of jointly optimizing the smoothing factor and the a priori SNR floor of the decision-directed a priori SNR estimation is shown using an FoM. For some noise reduction algorithms, yet unknown optimal values of the parameters of interest are identified by applying the FoM-based instrumental optimization method and subjectively verified.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An optimum microphone array post-filter for speech applications

This paper proposes a post-filtering estimation scheme for multichannel noise reduction. The proposed method extends and improves the existing Zelinski’s and, the most general and prominent, McCowan’s post-filtering methods that use the autoand crossspectral densities of the multichannel input signals to estimate the transfer function of the Wiener post-filter. A major drawback of these two spe...

متن کامل

A generalized estimation approach for linear and nonlinear microphone array post-filters

This paper presents a robust and general method for estimating the transfer functions of microphone array post-filters, derived under various speech enhancement criteria. For the case of the mean square error (MSE) criterion, the proposed method is an improvement of the existing McCowan post-filter, which under the assumption of a known noise field coherence function uses the autoand cross-spec...

متن کامل

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...

متن کامل

A Multichannel Feature Compensation Approach for Robust ASR in Noisy and Reverberant Environments

In this paper we propose a multichannel feature compensation approach for automatic speech recognition in reverberant and noisy environments. The proposed technique propagates the posterior of the clean signal estimated by a multichannel Wiener filter in short-time Fourier transform (STFT) domain into Mel-frequency cepstrum coefficients (MFCC) domain. The multichannel Wiener filter reduces both...

متن کامل

Multichannel MMSE Wiener Filter Using Complex Real and Imaginary Spectral Coefficients for Distributed Microphone Speech Enhancement

In this paper, the authors propose a frequency domain multichannel Wiener filter for distributed microphone speech enhancement using acoustic arrays. The current state-of-the-art single channel estimators achieve noticeable performance gains using the to-noise ratio (SNR) and segmental signal-to-noise ratio (SSNR) objective measures, which measure noise reduction, but only achieve marginal perf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013